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Abstract 

Consider designing a transportation network on n vertices in the plane, 
with traffic demand uniform over all source-destination pairs. Suppose the 
cost of a link of length I and capacity c scales as tcP for fixed < (3 < 1. 
Under appropriate standardization, the cost of the minimum cost Gilbert 
network grows essentially as n°^', where a(/3) = 1 — ^ on < /3 < | 
and ct(j3) = i + jOn|</3<l. This quantity is an upper bound 
in the worst case (of vertex positions), and a lower bound under mild 
regularity assumptions. Essentially the same bounds hold if we constrain 
the network to be efficient in the sense that average route-length is only 
1 + o(l) times average straight line length. The transition at (5 — | 
corresponds to the dominant cost contribution changing from short links 
to long links. The upper bounds arise in the following type of hierarchical 
networks, which are therefore optimal in an order of magnitude sense. 
On the large scale, use a sparse Poisson line process to provide long- 
range links. On the medium scale, use hierachical routing on the square 
lattice. On the small scale, link vertices directly to medium-grid points. 
We discuss one of many possible variant models, in which links also have 
a designed maximum speed s and the cost becomes le* s 1 . 



1 Introduction 

To design a transportation network linking specified points (visualized as cities) 
in the plane, one might specify a cost functional and a benefit functional on all 
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possible networks, and then consider networks which are optimal in the sense of 
minimizing cost for a given level of benefit. This paper addresses one particular 
choice of functionals, but our broader purpose (see section 1.1) is to draw the 
attention of statistical physicists to this class of problem. 

We study a simple model involving the "economy of scale" idea 

One link of length £ and capacity 2c is less than twice as expensive 
as two links of length £ and capacity c. 

We capture this idea by specifying that the cost of a link of length t and capacity 
c scales as id 3 for some < j3 < 1. In the real world, network designers do not 
know in advance what traffic demand will be. We simplify by assuming that 
traffic demand is known (and uniform over all source-destination pairs) and 
routes are controlled, so that the volume /(e) of flow across an edge (link) e 
can be determined by the designers, and the corresponding link-capacity built. 
(Visualize links as roads, and flow- volume /(e) as "number of vehicles per hour" . 
We are ignoring stochastic fluctuations in traffic). Thus our cost structure is 

cost of network = ^f(e)/ /3 (e) (1) 

e 

where £{e) = length of link e. 

To define the model carefully, write x™ = {x\, X2, ■ ■ ■ , x n } for a configuration 
of n vertices in the square [0,n 1/<2 ] 2 of area n. So Xi is the position of vertex i. 
Create a connected network G(x") by adding links: links are line-segments with 
their natural Euclidean lengths, and links may meet at places not in the given 
vertex-set x". To make the distinction clear let us refer to the given n vertices 
as cities and any meeting places (which depend on our choice of network) as 
junctions. Between each source-destination pair of cities, flow of volume 
n~ 3 / 2 (this scaling is explained below) is routed through the network. Define 
cost(G(x™)), the cost of the network, via (1). This setting specializes a setting 
considered by Gilbert [1], and we call the minimum-cost network the Gilbert 
network Gil(x"). See [2] for general properties of, and heuristic algorithms for, 
Gilbert networks over deterministic points. 

Gilbert networks may be optimal from a network operator viewpoint, but 
what about a network user? Write £(xi,Xj) for route-length, and \xj — Xi\ for 
straight-line distance, between cities i and j. For a typical configuration, the 
average distance I will be order n 1 / 2 . The kind of "benefit to users" 

we have in mind is that the network provides routes almost as short as possible. 
So we call the sequence of networks (G(x™)) modestly efficient if 

&veij(£(xi,Xj) - \xj - Xi\) = o(n 1/2 ). (2) 

The name reflects the remarkable fact [3] that there exist extremely efficient 
networks for which this average is O(logn) while their length is only 1 + o(l) 
times the minimum length of any connected network; such results pay no at- 
tention to flow-volumes or capacities, and so constitute the (3 = case of the 
present model. The problem we address in this paper is: 
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given the sequence (x™), how small can we make cost(G(x™)) subject 
to the modestly efficient constraint (2)? 

In the (3 = case just mentioned, we can make cost(G(x™)) be asymptotically 
the length of the Steiner tree (minimum length connected network) on x™, which 
is well known to be 0(n) in the worst case and in the typical case. Recall that 
a„ = 0(b n ) means that a n /b n is bounded as n — > oo. It is often convenient to 
write the converse relationship b n = 0(a = 0(6„); if both a n = 0(b n ) 

and a n = £l(b n ) then we write a n = 0(6„). 

In the case (3 = 1 there is no "economy of scale" and so the minimum-cost 
network is just the complete graph, that is a direct link between each pair of 
cities. The associated cost is 

* 3 

which is 0{n) in the worst case and in the typical case. 

Recall that the Gilbert network Gil(x") is the minimum-cost network when 
there is no extra "modestly efficient" constraint. Theorem 1 shows that im- 
posing the "modestly efficient" constraint makes little difference in an order of 
magnitude sense: in either case the optimal cost grows roughly as order n a ^\ 

Theorem 1 Fix < (3 < 1 . Define 

a(/3) = 1-f, 0<(3<\ 

= |</3<i. 

Let x™ be a configuration of n cities in the square [0,n 1//2 ] 2 . 

(a) Case < (3 < |. There exist modestly efficient networks for which 
cost(G(x™)) = 0(n a( ^) (except for (3 = 1/2 the bound is 0(n 3 / 4 logn)J. Under 
the technical assumption (7) there do not exist connected networks for which 
cost(G(x")) = o(n a W). So under (7) we have cost(Gil(x™)) = 9(n a W) for 
(3<\. 

(b) Case i < /3 < 1. Here cost (Gil fx")) = 0(n a ^). Given lu„ -» oo arbi- 
trarily slowly, there exist modestly efficient networks for which cost(G(x™)) = 
0{uj n n a ^). Under the technical assumption (8), cost ( Gil (x™)) = <d{n a{ -^), but 
there do not exist modestly efficient networks for which cost(G(x™)) = 0(n a ^). 

Our discussion above of the cases (3 = and (3=1 implies corresponding results 
in these cases with a(0) = 1 and a{l) = 1. 

The transition at f3 = | corresponds to the dominant cost contribution 
changing from short links to long links, as we will explain in section 2.5. The 
technical regularity assumptions that we need to impose to obtain lower bounds 
reflect this transition: for (3 < 1/2 we need to assume that nearest-neighbor 
distances are not atypically small, whereas for (3 > 1/2 we assume a large-scale 
cquidistribution of the city configuration. (We defer statements of these assump- 
tions until the place they are actually used in the proof, to avoid interrupting 
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the conceptual discussion here.) We show (section 2) that the upper bounds 
arise in the following type of hierarchical networks, which are therefore optimal 
in an order of magnitude sense. On the large scale, use a sparse Poisson line 
process to provide long-range links. On the medium scale, use hierachical rout- 
ing on the square lattice. On the small scale, link cities directly to medium-grid 
points. It is perhaps counter-intuitive that one can use the same network for 
the whole range of (3; the point is that only the medium-small scale structure 
really matters for (3 < 1/2 and only the large scale structure really matters for 
(3 > 1/2. Our arguments implicitly imply some weak properties of the exactly 
optimal networks. Undcstanding in detail the structure of the Gilbert network 
(or the asymptotically optimal modestly efficient network) over random points 
in the critical case (3 = 1/2 is a challenging problem, interesting because one 
expects the network to have some scale-free stucture, in the (correct) sense of 
invariance under spatial and flow-volume rescaling. 

One can imagine many variant models in which extra structure is incorpo- 
rated. In section 4 we briefly discuss the case where links have designed speed s 
and where the cost of a link becomes ic^ 1 's 7 ; in this case an analog of Theorem 
1 remains true. 

1.1 Optimal spatial network design methodology 

This paper contributes to a general program concerning networks linking points 
in the plane: 

for mathematically simple cost/benefit functionals, study the prop- 
erties (geometry, cost and benefit values) of optimal networks as the 
number n of points tends to infinity. 

Network design problems arise in many applied fields, but serious real-world 
modelling leads to more complicated functionals tuned to specific applications 
than we have in mind. As complementary work, [3] gives a detailed treat- 
ment of the extremely efficient networks mentioned above that minimize av- 
erage route length subject to total network length; and [4] analyzes a model 
(for e.g. passenger air travel or package delivery) where there is a substantial 
cost to transfer from one link to another. In the latter model, theory predicts 
that hub-and-spoke networks (as seen in the real world) are near-optimal and 
that, constraining the average number of transfers to be say 2, the length of the 
shortest possible network scales as n 13 / 10 . 

The methodological feature we want to emphasize concerns models for the 
position of n cities (assumed for simplicity in a square of area n). In each 
problem we have studied one gets the same order of magnitude for optimal 
network cost for worst-case positions as one gets for arbitrary positions (under 
mild assumptions) and in particular the same as for random positions or for 
regular (e.g. lattice) positions. 

The bulk of statistical physics literature on spatial networks (surveyed in 
[5]) analyzes networks built according to some specific probability model which 
combines ingredients such as 



4 



(a) geometric random graphs (link probability depends on inter- vertex distance); 

(b) proportional attachment probabilities for arriving vertices; 

(c) prescribed power law distribution of lattice vertex degrees; 

(d) networks based on recursive partitioning of space. 

This theoretical literature makes passing reference to optimality, but we have 
not seen analytic results demonstrating optimality over all possible networks in 
the spatial context (see [6] for non-spatial results, and [7, 8] for assumptions 
under which optimal networks are trees). For interesting empirical work sec [9]. 

Our scaling conventions (a square of area n; flow- volume n~ 3//2 between each 
source-destination pair) may seem arbitrary but are chosen to fit the following 
standardizations: 

(i) cities have density 1 per unit area; 

(ii) flow volume across unit area is order 1. 



2 The construction 

A network satisfying the requirements of Theorem 1 will be constructed in 
section 2.3 using mathematical ingredients described in sections 2.1 and 2.2. 
Figure 1 illustrates the construction. 




Figure 1. Ingredients of the construction. Left: the hierarchical routing 
lattice, with higher-type edges indicated by thicker lines, and a typical route shown. 
Right: the large-scale grid and the Poisson line process. 



2.1 Hierarchical routing on the square lattice 

Fix M and consider the square grid on vertices {0, 1,2,..., 2 M — l} 2 . Declare 
lines (and their edges) to be of some type 0,1,2, ... ,M according to the rule: 
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the horizontal lines {(a;, y) : y = (2j — l)2 m }, j = 1, 2, . . . are type m 

the boundary line {(x, 0)} is type M; 
and similarly for vertical lines. For each vertex (x, y), define a route from (x, y) 
to (0, 0) using only downward and leftward edges as follows. First choose the 
edge at (x,y) of higher type (breaking ties arbitrarily). Then repeat the rule 

Follow the current edge until it crosses an edge of strictly higher 
type, then transfer to that edge 

until reaching (0,0). See Figure 1, left side. 
It is elementary to verify 

Lemma 2 For each < m < M, the number of type-m edges traversed by the 
route is at most 2 m+1 . 

2.2 The Poisson line process 

A line in the plane may be parametrized by the point z on the line which is 
closest to the origin (so the line segment from the origin to z is orthogonal to 
the line); then write z in radial coordinates as {r,9). Recall [10] the notion 
of a Poisson line process (PLP) of intensity rj > 0, which makes precise the 
notion of "completely random" lines in the plane. Parametrizing lines by by 
their closest points (r, 6) , this PLP has intensity rj with respect to Lebesgue 
measure on parameter space (0, oo) x (0, 2n). The PLP distribution is invariant 
under Euclidean transformations, and for a fixed set A 

E(lcngth of line segments intersecting A) = nr] x area(A). (3) 

(We write E for expectation and P for probability). The next result shows how 
the PLP is useful in constructing spatial networks. See Figure 1, right side. 

Lemma 3 Let n 1 / 2 /a n be an integer. Construct a network as the superposition 
of the rectangular grid with cell side-length o~ n and the Poisson line process of 
intensity rj, intersected with the square [O,?! 1 / 2 ] 2 . Let Vi,Vj be vertices of the 
grid. Then 

K(route-length Vi to Vj) < \vi — Vj\ + Ci ^ log(?7V / 2n) 
for an absolute constant Ci. 

Lemma 3 is proved in [3], Lemma 11, and we will not repeat the argument here. 
(In essence, one analyzes the natural routing algorithm: move to a nearby line 
of the PLP, move along that line in the direction closer to the direction of the 
destination city, and when encountering another line of the PLP, switch to that 
line if its direction is closer to the destination city direction). Using the PLP 
gives us random networks, but a typical realization will have costs and lengths 
of the same order as the expectations in our formulas. 
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2.3 Construction of the networks 



We now describe how the ingredients above (hierarchical routing on the square 
lattice, the PLP) are used in a network construction. Recall x" denotes the 
given configuration of n cities. Take integers 9 n | oo slowly and define 

<r n = n^ 2 /9 n . 

Let M n be the integer such that 

a n /2 < 2 M " < a n . 

Define 

s„ = cr„/2 M ™, (sol<s„<2). 

Construct a network G(x") as follows. 

(i) Take the large-scale network in Lemma 3, with r\ n = Q n nT x ^ 2 . This 
network contains large cells of side-length a n . 

(ii) Inside each large cell put a copy of the hierarchical routing lattice of 
section 2.1, with M = M n , and scaled so that the basic small cell of this lattice 
has side- length s n . 

(hi) Link each city x <G x" via a straight edge to the bottom left corner 
vertex v(x) of its small cell. 

Figure 1 illustrates (i) and (ii). There is a natural way to define a route from 
Xi to Xj in this network. From X{ take the link to v(xi), then follow the section 
2.1 routing scheme to the lower left corner V(xi) of the large cell; navigate from 
V(xi) to V(xj) via the shortest route in the Lemma 3 graph. 

Note that in addition to the given n cities, this network has several different 
kinds of junctions: the vertices of the grid, and places where lines of the PLP 
cross each other or cross the grid lines or cross the short stage (iii) links. In our 
model there is no cost associated with creating a junction or with routes using 
junctions; the costs involve only link lengths and route lengths. So the exact 
number of junctions is unimportant. 

2.4 Analysis of the networks 

Clearly 

l(x,,x 3 ) < l{y(x l ),V{x j )) + 2^ 2 a n 

and so by Lemma 3 

Ee(xi,xj) < \xi -xj\ + 2 3 / 2 a„ + Ca^ log(77„\/2^). 

From the definitions of a n , rj n we see 

E(e{xi,Xj) - \xi-Xj\) = o(n 1/2 ) 

establishing the modestly efficient property. 

To analyze costs, we treat stages (i)— (iii) separately, and check that each 
stage cost is less than the bounds stated in Theorem 1. 
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Stage (in). There are n links of the form (x, v(x)), each carrying flow volume 
2(1 — i)n -1 / 2 , and each having length at most s n ^/2, and so 

the total cost of stage (iii) links is 0{n 1 ~^). (4) 

Stage (ii). Now let £ m be the set of type-m edges. The number of such 
edges is #£ m = 0(n2~ m ). Recall that Holder's inequality shows that for any 
edge-set £ 

ee£ \ee£ 
Now 

E /(e) = 2n~ 1 / 2 E #{ type-m edges in route v(x) to V(x)} < 2 m+2 n 1 / 2 
ee£ m x ^ n 

using Lemma 2. Thus 

E /"(e) = O ((n2- m ) 1 -' 3 (2 m n 1 / 2 )' 3 ) = O (n 1 -^ 2 ^ 1 )) . (5) 

eef m 

Writing £ me d for all edges in the copies of the hierarchical routing lattice, we 
find after summing over < m < M 



E f(e) = Oft-*), 0</3<i 
= o(n 3 / 4 logn), (3=\ 

= 0(n 1 -h M W-V)=0(n^), \<P<\ 



using 2 M < n 1 / 2 . Because edge- lengths here are s n < 2, these are bounds for 
the costs associated with stage (ii). 

Stage (iii). Write £i ar ge for the set of links of the large-scale network, that 
is the large-scale grid and the PLP lines. Flow along the route from V(xi) to 
V(xj) contributes n~ 3 / 2 £(V(xi), V(xj)) to the "flow x distance" measure, and 
so 

O large j j 

where the left side denotes integrating along all links of the large-scale network. 
By the already-established modestly efficient property, 

E E e ( y (*o. v( Xj )) = (i+ (i))EE^-^i = °(" 5/2 ) 

i j i j 

and so 

f(e)de = 0(n). 



rge 
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The total length L n of £i arge is the sum of O{n}/ 2 n ) (= contribution from 
large-scale grid) and 0(r] n n) (= contribution from the PLP, using (3)), and so 
L n = Oir^l^Qn). The integral form of Holder's inequality now shows that the 
cost associated with £i ar ge is : 

f f(e)de<L^x(r f(e)de) = O fa-W+M) . (6) 

J C large \^Ol arge J 

Examining the cost of each stage, we check that the modestly efficient network 
we have constructed has its cost bounded as stated in Theorem 1. Moreover, 
if we eliminate the "modestly efficient" constraint then we can eliminate Stage 
(hi) of the construction (take 9 n = 1) and get the stated 0(n a ^) upper bound. 

2.5 The transition at (3 = 1/2 

To summarize, the costs associated with the constructed networks arising from 
short, medium and large-scale links are bounded by expressions (4,5,6) respec- 
tively. By examining the exponents of n we see that the transition at [3 = i 
corresponds to the dominant cost contribution changing from short links to long 
links. The arguments we give below for the lower bound show this is a genuine 
effect (no alternate networks can do essentially better), not an artifact of the 
particular networks contructed above. 

3 The lower bound 

In the settings of [3, 4] the lower bounds require some effort to prove, but in the 
present setting the proofs are short. 

3.1 The case < f3 < 1/2 

Consider first the case < (3 < 1/2. Impose the condition: there exists some 
small 5 > such that 

for at least 5n of the cities of x™ , 

the distance to the nearest neighbor is at least 5. (7) 

Consider a city i£x" satisfying this condition, and consider the link-segments 
of an arbitrary connected network within distance 8/2 from x. Because flow 
of volume 2n~ 1//2 must enter or leave x, the cost associated with these link- 
segments (which by concavity of / — > f 13 is minimized when there is a single 
link-segment) is at least 5/2 x (2n -1 / 2 )^. Summing over all (there are at least 
Sn) such cities x, noting the link-segments are distinct as x varies, the network 
cost is at least Sn x 6/2 x (2n" 1 / 2 ) /3 = f^n 1 ^). 
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3.2 The case 1/2 < (3 < 1 

In the case l/2</3<lwe impose the classical equidistribution property for 
the configuration x™ = (x™, 1 < i < n) rescaled back to the unit square: 

the empirical distribution of {n~ 1 ' 2 x™, 1 < i < n} converges 

in distribution to the uniform distribution on [0, l] 2 . (8) 

Our standardization conventions imply that the total volume of flow through 
the network is 0(n 1 / 2 ) and so assertion (a) below is obvious. 

Lemma 4 (a) In the Gilbert network Gil(x"), the maximum edge-flow is bounded 
as 

max /(e) = 0(n 1/2 ). 

e 

(b) For any modestly efficient network (G(x™)) on configurations satisfying the 
equidistribution condition (8), the maximum edge-flow is bounded as 

max/(e) = o(n 1//2 ). 

e 

Granted this result, use the fact 

y^l(e)/(e) > ™~ 3 ^ 2 ^^ \x% — Xj\ = 6(n) by equidistribution 

e i j 

and the general inequality 

to deduce that cost(G(x")) grows strictly faster than njn^^l 2 = ti"^' for 
any modestly efficient network, and no slower than order n a ^' for the Gilbert 
network. 

Proof of Lemma 4(b). We first quote an easy fact from geometry. 

Lemma 5 Let Z\ , Z 2 be two independent uniform random points in the unit 
square [0, l] 2 . There exists a constant C such that for all x e [0, l] 2 and all 
5 > 

PflZi - x\ + \Z 2 -x\< \Z X - Z 2 \ +8)< OS 1 ' 2 . 

Now fix S > 0. Write Xi,X 2 for two uniform random picks from the set x" of 
cities. The modestly efficient assumption implies 

¥(e(X u X 2 ) > |Xi -X 2 \ + 5n 1/2 ) -^OasTi^oo. 

Lemma 5 and the equidistribution assumption (8) imply 

F(\X 1 -x\ + \X 2 -x\<\X l -X 2 \+8 for all x) < CS 1/2 + o(l). 
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In order for the route from X\ to X2 to pass through point x, one of the two 
inequalities above must hold, and so 

supP( route X\ to X 2 passes through x ) < Cd 1 ^ 2 + o(l). 

X 

But 5 is arbitrary, so this probability is o(l), and the flow volume is exactly 
n 1 / 2 times this probability. 

4 Associating speeds with links 

The main feature of our model - that the cost of building a link is sublinear in 
link capacity - is just one of many realistic features one might want to incorpo- 
rate into a model. By focussing on route lengths, we have implicitly assumed 
that users travel at constant speed. A notable feature of real road or rail net- 
works is that different links permit different speeds. In this section we state 
and briefly discuss a variant model in which links can be designed to permit 
different speeds. 

Suppose a link with length £, nominal capacity Co and nominal speed So costs 
VcqSq, for fixed < 7 < 00. On such a link, traffic moves with speed so provided 
the flow-volume / is at most Co; for larger flow-volumes, congestion causes the 
speed to drop, reaching speed zero (jammed) at volume <rc for a constant a. 
So aco is the maximum capacity. Precisely, 

speed at flow- volume / = soG(f /cq) 

where G(u) = 1 for < u < 1 and G(u) decreases from 1 to as u increases from 
1 to o . Otherwise the model is the same as before: we are given a configuration 
of n cities in the square of area n, and we are required to route flow of volume 
n~ 3 / 2 between each source-destination pair. 

For any network and feasible routing, define average speed as 

- avej j\Xi Xj 

speed = ^— - - — J - 

where t(xt,Xj) is the time taken to travel from Xi to Xj. For this model, we ask 

What is the minimum cost for a network on a given configuration 
x™ of cities that allows speed = s? 

The answer is that, under the regularity assumptions of Theorem 1 (which are 
needed only for lower bounds), and ignoring O(logn) terms. 

minimum cost grows as order s 7 n a *^' 7 \ where (9) 

<**(/?, 7) - 1 - f - i, 0< 2/3 + 7 <1 
= l<2/3 + 7 . 
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Let us briefly indicate how the previous analysis is adapted to this setting. 
Because costs scale with design speed So as Sq, it is enough to consider the case 
speed = 1, and show that minimum cost grows as order n a To construct 

a network, use the networks constructed previously and assign design speeds as 
follows. For links of the large-scale network, which routes will use for a distance 
of order n 1 / 2 , design speed of order 1. For type m edges in the hierarchical 
routing lattice, which routes will use for a distance of order 2 m , design speed 
of order 2 m n _1 / 2 logn. For the local links of the form (x,v(x)), which routes 
will use for distance 0(1), design speed of order n -1 / 2 . This ensures the typical 
times t(xi,Xj) are of order n 1 / 2 as required. To calculate the cost, we simply 
combine the previous estimates (4,5,6) of costs of providing flow- volumes of 
different links with the costs of the design speeds stipulated above; the total 
cost is of order 

M 

n i-# x n~i + ]T „i-f 2 m < 2 "- 1 > x (2 m n- 1 / 2 logn)^ + n^' 2 x 1 

m=0 

and this works out to be of the form n a stated. 
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